Tripoli University Computer Architecture (EE434) 
EEE-Department Spring - 2013 


Midterm Test # 2 Solution 


Q1-(30pts) 
Signed & Unsigned Multiplication and Division 


a) Multiply X (Multiplicand) = 112 to Y(Multiplier) = 110112, use one of the sequential 
unsigned multiplier versions (S5pts). How many registers are required, what are their 
sizes, and what do they hold? (3pts). How large an ALU is required? (2pts). 

Note (X and Y are 5-bit signed numbers in 2's complement representation). 
Carell Gr pal 8 LE} uya all G pb asl plastid ga Igual laa à Gs lhal 
X=MC=112 Y=MR=110112 
‘2's complement dill å oyisi y Cu 5 aam oye Led X & Y onal of le Aba Dual) å 4p pill ai ad, 
BLE! Cy gt Gr petal Alec pasini SI BLE! Gye oal a pats a sii of dlia lhi ligà 
Bt (5 slurs Gir ge adl) GS Cy 5 J isal 000119 Ge syle soll Ibe X=112 cd 5¥! all 
OSS 9 SAYL aac oll Alisa a slid Gills sand GS) Os 5 Jina aadi laa Y=1101 12 AÙ sacl 
Y=-(00101) =-5 
eds lalina gill 6 LE! oy gay Gr paca Ai g yall Ghall (asl pasini g 8 LEY! Ge af ll ay jad ais 
1st version unsigned multiplication 
2nd version unsigned multiplication 
3rd version unsigned multiplication 
Ca ees gy Gall ihe any 
PR=00000 0 1111 =15 
Aail anai g BLE! are gs a si 
-15 


How many registers are required, what are their sizes, and what do they hold? (3pts). 
How large an ALU is required? (2pts). 
CB ildi pal Aisles La le GSi Gl Gay y diasin gall Ga pail Gus Gle GSi ilay! Lis 


e {st version unsigned multiplication: 
Three registers. 
MC and PR registers are 10 bits. hold Multiplicand and Product. 
MR register is 5 bits. holds multiplier. 
ALU size = 10 bits. 
e 2nd version unsigned multiplication 
Three Registers 
MC and MR registers are 5 bits hold Multiplicand and Multiplier. 
PR register is 5 bits holds Product. 
ALU size = 5 bits. 
e 3rd version unsigned multiplication 
Two Registers. 
MC register size = 5 bits holds Multiplicand. 
PR register size = 5 bits holds Product. 
ALU size = 5 bits. 
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b) Divide X (Dividend) = 100112 by Y(Divisor) = 10112, use one of the sequential divider 


versions (5pts). How many registers are required, what are their sizes, and what do they 
hold? (3pts). How large an ALU is required? (2pts). 
Note (X and Y are 5-bit signed numbers in 2's complement representation). 


Qe piers Aeull plae patty Cia gal adadi Jya Gills gl ag gti y ai YI gle cath uot) Aiya 5 ill 5 SÀ Gull 


celldlls s LYI 
X= 10011 = -(01101)= -13 

Y=01011=+11 

-13/11 


Q sign = Div sign --> Q = -1 

Rem sign = Div sign xor Dr sign = - xor + = 1 xor 0 =1 --> Rem sign is -ve 
Rem = -2 

Div = Q * DR + Rem 

Div = -1 * 11 + -2 = -13 

SAY) 3 ial iilh 

How many registers are required, what are their sizes, and what do they hold? (3pts). 
How large an ALU is required? (2pts). 

1st version 

Three Registers 

DR and Rem resgiters 10 bits 

Q register 5 bits 

ALU 10 bit 

3rd Version 

Two Registers 

DR register 5 bit 

Q and Rem in the same register = 10 bits 

ALU 5 bits 


How many adders (Half and Full) are required to implement a 2bit x 2bit array signed 
multiplication using Bough Wooley's method? (6pts). Calculate the longest gate delay in 
the carry path(4pts). 


leuy ab Cpe gy Ai hll Gl pled Gahi Cpe GY pal Callell's pal lll ase aS 4 ys SI 
Cu 2 SI ihya GSI ce) Aili cy 4 J lbs) 4) hli dull 
A plia itaati LA & y yucll 


X1Y0 x0YO 
ig X1Y1 x0Y1 


Z3 Z2 ZI ZO 
cele daxi oj gai aus sll aes 
1 Full Adder, 2 Half Adder 
Gate Delay in the longest carry path = 7 Gate Delay 
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Q2-(25pts) 
Floating Point Arithmetic 


a) 


oO 
~~ 


Represent (—1101.011 x 2'39) in IEEE-754 single precession format. (5pts) 
Single Precession Format S = 1, E = 8, F= 23 bit 
—1101.011 x 2% = —1.101011 x 2'7 
Val(E)= E-127 --> Val(E) = -127= E - 127 -->E=0 
iala Ula ii E=0 Ll GY Gls y sill ode (4 andl he bed bis YW E=0 ol ly 
Denormalized +Í olai gle a Sie of J lai 
Denormalized format (-1)S x 0.Fx 218 
Denormalized J 4k sai xie aall lis s 
—1.101011 x 2'°’=-0.1101011 x 27% 
Denormalized Number +4) olaf dle aal aa Jäi ghin s 
S=1 E=0 F=1101011 
1 0000 0000 1101 0110 0000 0000 0000 000 


Consider A and B are floating point numbers in (S',E’,F°) format. A=1.001x2° and 
B=1.11101. Perform the following operations, round the result to the nearest even, and 
then represent the result in (S',E°,F°) format. What is the decimal value of the result 
according to this format (S',E°,F°). 
i. A+B (10pts) 
danii 
0 100 000110 
Decimal Value 2.1875 
aag El ghil) Lg glail! Lelia 
i. AxB (10pts) 
danii 
0 010 000100 
Decimal Value 0.53125 


Paay Egh ggat gga 


Q3-(20pts) 


Performance 


You are going to enhance a computer, and there are two possible improvements: either 
make multiply instructions run four times faster than before with speedup 1.15, or make 
memory access instructions run two times faster than before. The performance for 
multiplication hardware before improvement is 5x10° per second. and for memory access 
instructions is 2.5x10° per second. 


If you improve Multiplication hardware the speedup =E= 1.15 
E = Ex_time before/ Ex_time with enhancement 
Multiplication Performance = 5x10 --> 

Ex_time for multiplication = 1/ Multiplication Performance 
Ex_time for multiplication = 20 sec 

Ex_time for memory access = 40 sec 
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i. | What is the overall execution time before improvement. (Spts) 
1.15 = Ex_Time Before /( (Ex_Time Before - f1) + f1/4) 
1.15 = Ex_Time Before /( (Ex_Time Before - 20) + 20/4) 
Ex_Time Before = 115 sec 

ii. | Whatis the overall execution time if you improve only multiplication? (Spts) 
Ex_Time with multiplication enhancement = Ex_time before/ Speedup 
=115/1.15 = 100 sec 


iii. | What will the speedup be if you improve only memory access?(5pts) 
E = 115/ (( 115-40) + 40/2) 
E=1.2015 


iv. What will the speedup be if both improvements are made? (5pts) 
E = 115/ ((115-40-20)+ 40/2 + 20/4) 
E = 1.4375 


Q4-(25pts) 
Single Cycle Datapath and Control 


a) Draw the hardware of the register file shown below. (10pts) 


Registers 





Clock 
RegWrite 8 
Dl À age gall aus ll Cadi ge aill oda Ja 
CH5_5.1_5.4 P2.ppt slide 7 
2:4 Al aaa La OSI; 
Cy 8 9 floss pil aay 


oO 
~~ 


Consider the following single-cycle datapath for the MIPS processor implementing a 


subset of the instruction set (R-Type, Immediate Arithmetic and Logic, LW/SW, jump 
and branch instructions). If the following signals have a stuck at O or 1. Which 


instructions mentioned above will not work correctly? Explain why. 
i. RegDst stuck at 0. (5pts) 


Immediate Arithmetic/Logic and LW instructions will not work correctly 


because the destination register should be chosen depended on 5-bit rt. 
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ii. ALUSrc stuck at 1. (Spts) 


R-type and Branch instructions will not work correctly because busB is need to 
be as input to the ALU to perform these instructions. 


iii. © MemToReg stuck at 1. (Spts) 


LW instruction will not work correctly because we need to store the Data_out 
from data memory to the destination register. 


30 Jump or Branch Target Address 


Instruction 
Memory 


Instruction 


Address 





MemRead 








MemtoReg 





Main 
Control 


Optional Question (25% of the mark obtained). 
What is the dynamic range and the special cases representation for IEEE-754 half precession 
format. 

AGS å gllaall J plal 

Half Precession S=1 E=5 F=10 

Jil sly cts hla dad AUS 

0 00000 0000000000 = +ve zero 

Largest +ve Normalized Number 

Smallest +ve Normalized Number 

Largest +ve De-Normalized Number 

Smallest -ve De-Normalized Number 

Largest -ve Normalized Number 

Smallest -ve Normalized Number 

Largest -ve De-Normalized Number 

Smallest -ve De-Normalized Number 

Special Cases Nan -zero +zero -o0 + o0 
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